Predicting the cofactors of oxidoreductases based on amino acid composition distribution and Chou's amphiphilic pseudo-amino acid composition

Guang-Ya Zhang; Bai-Shan Fang

doi:10.1016/j.jtbi.2008.03.015

Predicting the cofactors of oxidoreductases based on amino acid composition distribution and Chou's amphiphilic pseudo-amino acid composition

J Theor Biol. 2008 Jul 21;253(2):310-5. doi: 10.1016/j.jtbi.2008.03.015. Epub 2008 Mar 19.

Authors

Guang-Ya Zhang¹, Bai-Shan Fang

Affiliation

¹ Institute of Industrial Biotechnology, Huaqiao University, Quanzhou 362021, Fujian, PR China. zhgyghh@hqu.edu.cn

PMID: 18471832
DOI: 10.1016/j.jtbi.2008.03.015

Abstract

Predicting the cofactors of oxidoreductases plays an important role in inferring their catalytic mechanism. Feature extraction is a critical part in the prediction systems, requiring raw sequence data to be transformed into appropriate numerical feature vectors while minimizing information loss. In this paper, we present an amino acid composition distribution method for extracting useful features from primary sequence, and the k-nearest neighbor was used as the classifier. The overall prediction accuracy evaluated by the 10-fold cross-validation reached 90.74%. Comparing our method with other eight feature extraction methods, the improvement of the overall prediction accuracy ranged from 3.49% to 15.74%. Our experimental results confirm that the method we proposed is very useful and may be used for other bioinformatical predictions. Interestingly, when features extracted by our method and Chou's amphiphilic pseudo-amino acid composition were combined, the overall accuracy could reach 92.53%.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Amino Acids / analysis*
Coenzymes / chemistry*
Databases, Protein
Models, Chemical
Oxidoreductases / chemistry*

Substances

Amino Acids
Coenzymes
Oxidoreductases